Application No. 09/8 1 1 ,762 

Reply to Office Action of May 30, 2007 

Amendments to the Claims: 

This listing of claims will replace all prior versions and listings of claims in the 
application. 

Listing of Claims: 

1 . (Previously Presented) A method for analyzing a plurality of sets of values 
associated with a plurality of genes to identify genes whose associated values differ by an 
amount of statistical significance among the sets, said associated values comprising levels 
of mRNA or protein, said associated values acquired by a process where biological 
samples containing said plurality of genes are hybridized to one or more microarrays of 
probes, thus measuring the levels of mRNA or protein in the biological samples, wherein 
the method comprises: 

providing for each of the plurality of genes a parameter that contains information 
concerning differences in the associated values of that gene among the sets; 

adjusting the parameters of the plurality of genes so that variables related to the 
parameters are substantially independent of variations of scatter values or average 
associated values of the genes over the sets, said scatter values defined by the standard 
deviation of the associated values in the sets; 

deriving an observed value and an expected value of the adjusted parameter for 
each gene from the sets of associated values, said expected value being indicative of the 
extent of variations in the adjusted parameter introduced by the process; 

comparing the observed and expected values of the parameter to identify genes 
whose associated values differ by an amount of statistical significance among the sets; 
and 

providing a list of genes whose associated values differ by an amount of statistical 
significance among the sets. 
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2. (Previously Presented) The method of claim 1, wherein said adjusting includes: 
dividing the scatter values or average associated values of the genes into subsets 

each having a similar range of values, and calculating the standard deviation of each of 
the parameters within each subset; 

altering the parameters until a coefficient of variation of the standard deviations of 
the parameters among the subsets is minimized. 

3. (Previously Presented) The method of claim 1, further comprising obtaining 
said sets of associated values from multiple measurements of the plurality of genes, or 
values derived therefrom. 

4. (Original) The method of claim 1, wherein said sets of associated values 
represent gene expression or number of gene copies or levels of protein encoded by the 
genes. 

5. (Original) The method of claim 1, wherein said sets of associated values 
include calculated or predicted values. 

6. (Currently Amended) The method of claim 1, wherein said providing a 
parameter includes calculating a difference value between an associated value of each 
gene in a first of the sets or a value derived therefrom and an associated value of that 
gene in a second of the sets or a value derived therefrom; and 

wherein the parameter is a function of the difference value of that gene. 

7. (Previously Presented) The method of claim 6, wherein said providing a 
parameter further includes: 

generating for each of the plurality of genes a scatter value that quantifies 
variation in the associated values of that gene within the first and second sets; and 

wherein said parameter is a function of the scatter value and of the difference 
value, said parameter defining a relative difference value of that gene. 
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8. (Previously Presented) The method of claim 7, wherein said generating 
employs the following equation: 

S(i) = ({Va}{£ [*m(0 - SCO]' [*n(0 -^(0] 2 }) 1/2 

where gene (i) has associated values xi(i) and xu(i) in Ith and Uth states 
respectively in the first and second sets of associated values, I and U being positive 
integers; £ m and £ n are sums over associated values of gene (i) in states I in the first set 
and in states U in the second set respectively, where s(i) is the scatter value of gene (i), 
and a is a constant. 

9. (Previously Presented) The method of claim 8, wherein said calculating 
calculates the parameter d(i) from the following equation: 

d(i) = \xj(() - xjj(i)]/[s(.0 +s 0 ] 
where So is a constant, and x t (i) and Xu (0 are the average values of xi(i) and xu(i) 
respectively in the first and second sets of associated values. 

10. (Previously Presented) The method of claim 9, said adjusting comprising: 
dividing the scatter values or average associated values of the genes into subsets 

each having a similar range of values, and calculating the standard deviation of each of 
the parameters within each subset; and 

altering value of So until a coefficient of variation of the standard deviations of the 
parameters among the subsets is minimized. 

1 1 . (Previously Presented) The method of claim 1 , wherein said associated values 
of the genes are correlated with another variable so that each of said associated values has 
a corresponding value of the variable, and wherein the parameter is provided using a 
Pearson correlation coefficient related to a weighted difference between each of the 
associated values and an average associated value, the variance of the associated values 
and the variance of the variable, said difference weighted by the deviation of the 
corresponding value of the variable of such associated value from its average value. 
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12. (Original) The method of claim 11, wherein said variable is continuous. 

13. (Original) The method of claim 12, wherein said variable is time. 

14. (Previously Presented) The method of claim 11, wherein the parameter is 
selected using the Pearson correlation coefficient and a quantity So that has a value 
adjusted in said adjusting as follows: 

dividing the scatter values or average associated values of the genes into subsets 
each having a similar range of values, and calculating the standard deviation of each of 
the parameters within each subset; and 

altering the value of So until a coefficient of variation of the standard deviations of 
the parameters among the subsets is minimized. 

15. (Previously Presented) The method of claim 1 1, the number of sets of 
associated values being k, k being a positive integer, wherein said Pearson correlation 
coefficient r(i) is given by: 

r(o = ^ K**(0 - £(0)] [(y* - y)] / JZ fc (**(0-*(0) 2 £ k (y*-yy 

where x k (i) is the associated value of gene (i) in the kth set of associated values, 
x(i) the average of the associated values of gene (i) in all the sets, yk the value of the 
variable corresponding to Xk(i), y the average value of yk in all the sets, and £ fc is a sum 
over all values of k. 

16. (Previously Presented) The method of claim 1, wherein the associated values 
in each set are classified into two or more subsets with values in each subset having a 
correlation with one another, and wherein the parameter is selected using a quantity 
related to the variances between the associated values in the subsets of the sets and the 
variances of the associated values within each subset of the sets. 
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17. (Previously Presented) The method of claim 16, wherein the quantity relates 
to the sum of the variances between the associated values in the subsets of the sets and 
the sum of the variances of the associated values within each subset of the sets. 

18. (Previously Presented) The method of claim 17, wherein the parameter is 
selected using the Fisher discriminant and a quantity So having a value which has been 
adjusted in said adjusting as follows: 

dividing the scatter values or average associated values of the genes into subsets 
each having a similar range of values, and calculating the standard deviation of each of 
the parameters within each subset; and 

altering value of So until a coefficient of variation of the standard deviations of the 
parameters among the subsets is minimized. 

19. (Previously Presented) The method of claim 18, wherein the number of 
subsets of associated values of such set being k, k being a positive integer, and the Fisher 
discriminant F(i) is given by: 

F(0 = Z fc nk[Xk(0 "- (0]2/ Z fc Z; l X j®-Z*W 2 

where x k (i) is an associated value of gene (i) in the kth subset of associated 
values, Xk(0 me average of the associated values of gene (i) in the kth subset, x(i) the 
average value of the associated values of gene (i) in all of the subsets, n k the number of 
associated values in the kth set, a sum over all the associated values of gene (i) in the 
kth subset, and £ fc a sum of the associated values of gene (i) over all of the subsets. 

20. (Previously Presented) The method of claim 1, the sets of associated values 
referred to as original sets, wherein said deriving includes deriving said expected value 
by: 

permuting, for each of the plurality of genes, the associated values for such gene 
in the original sets to arrive at a number of different permutations; 

classifying the associated values in each permutation of each gene into 
corresponding permuted sets that are different from the original sets; and 
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supplying for each permutation a parameter value of each of the genes derived 
from an associated value of such gene in each of the corresponding permuted sets for 
such permutation or values derived therefrom. 

21. (Previously Presented) The method of claim 20, wherein said associated 
values of the genes are correlated with another variable so that each of said associated 
values has an associated value of the variable, wherein the permuting permutes the 
associated values so that at least each of some of the associated values has a different 
associated variable. 

22. (Currently Amended) The method of claim 21 , wherein the associated values 
are classified into two or more subsets with values in each subset having a correlation 
with one another, wherein the permuting permutes the associated values so that at least 
each of some of the associated values is in a subset different from the subset it is 
classified int\e into . 

23-27. (Cancelled) 

28. (Previously Presented) A method for analyzing a plurality of original sets of 
values associated with a plurality of genes to identify genes whose associated values 
differ by an amount of statistical significance among the sets, said associated values 
comprising levels of mRNA or protein, said associated values acquired by a process 
where biological samples containing said plurality of genes are hybridized to one or more 
microarrays of probes, thus measuring the levels of mRNA or protein in the biological 
samples, wherein the method comprises: 

calculating for each of the plurality of genes a value for a statistical parameter 
indicating differences between associated values of such gene among the original sets; 

ranking the values of the parameter of the genes; 

providing an expected value of such parameter for each rank, wherein said 
providing includes permuting the associated values in the original sets to arrive at sets 
different from the original sets for each permutation, deriving a value of such parameter 
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for each permutation, and ranking such values, said expected value for each rank being 
indicative of the extent of variations in the parameter for parameters in said rank 
introduced by the process; 

comparing the calculated and expected values for the parameter of the same rank 
to identify genes whose associated values differ by an amount of statistical significance 
among the sets; and 

providing a list of genes whose associated values differ by an amount of statistical 
significance among the sets. 

29. (Previously Presented) The method of claim 28, wherein said providing 
comprises: 

for each permutation, deriving a value of the parameter for each gene and ranking 
the genes by their associated parameter values; and 

determining the expected value of such parameter for each rank by computing an 
average value of the parameter of all the permutations having such rank. 

30. (Previously Presented) The method of claim 29, wherein said comparing 
comprises identifying a gene as one whose associated values differ by an amount of 
statistical significance among the sets when the difference for such gene between the 
calculated value of the parameter of a rank and the expected value of such parameter of 
the same rank exceeds a threshold. 

31-32. (Cancelled) 

33. (Original) The method of claim 28, wherein the sets of associated values in 
each permutation contains approximately an equal number of associated values from each 
of the original sets of associated values. 

34-43. (Cancelled) 
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44. (Previously Presented) A computer readable storage device embodying a 
program of instructions executable by a computer to perform a method for analyzing a 
plurality of sets of values associated with a plurality of genes to identify genes whose 
associated values differ by an amount of statistical significance among the sets, said 
associated values comprising levels of mRNA or protein, said associated values acquired 
by a process where biological samples containing said plurality of genes are hybridized to 
one or more microarrays of probes, thus measuring the levels of mRNA or protein in the 
biological samples, wherein the method comprises: 

providing for each of the plurality of genes a parameter that contains information 
concerning differences in the associated values of that gene among the sets; 

adjusting the parameters of the plurality of genes so that variables related to the 
parameters are substantially independent of variations in scatter values or average 
associated values of the genes over the sets, said scatter values defined by standard 
deviation of the associated values in the sets; 

deriving an observed value and an expected value of the adjusted parameter for 
each gene from the sets of associated values, said expected value being indicative of the 
extent of variations in the adjusted parameter introduced by the process; 

comparing the observed and expected values of the parameter to identify genes 
whose associated values differ by an amount of statistical significance among the sets; 
and 

providing a list of genes whose associated values differ by an amount of statistical 
significance among the sets. 

45. (Cancelled) 

46. (Previously Presented) A computer readable storage device embodying a 
program of instructions executable by a computer to perform a method for analyzing a 
plurality of original sets of values associated with a plurality of genes to identify genes 
whose associated values differ by an amount of statistical significance among the sets, 
said associated values comprising levels of mRNA or protein, said associated values 
acquired by a process where biological samples containing said plurality of genes are 
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hybridized to one or more microarrays of probes, thus measuring the levels of mRNA or 
protein in the biological samples, wherein the method comprises: 

calculating for each gene a value for a statistical parameter indicating differences 
between associated values of such gene among the original sets; 

ranking the values of the parameter of the genes; 

providing an expected value of such parameter for each rank, wherein said 
providing includes permuting the associated values in the original sets to arrive at sets 
different from the original sets for each permutation, deriving a value of such parameter 
for each permutation, and ranking such values, said expected value for each rank being 
indicative of the extent of variations in the parameter for parameters in said rank 
introduced by the process; 

comparing the calculated and expected values for the parameter of the same rank 
to identify genes whose associated values differ by an amount of statistical significance 
among the sets; and 

providing a list of genes whose associated values differ by an amount of statistical 
significance among the sets. 

47-57. (Cancelled) 

58. (Previously Presented) A computer system for analyzing a plurality of sets of 
values associated with a plurality of genes to identify genes whose associated values 
differ by an amount of statistical significance among the sets, said associated values 
comprising levels of mRNA or protein, said associated values acquired by a process 
where biological samples containing said plurality of genes are hybridized to one or more 
microarrays of probes, thus measuring the levels of mRNA or protein in the biological 
samples, wherein the system comprises: 

one or more computers; 

one or more computer programs running on the computer(s), performing the 
following: 

providing for each of the plurality of genes a parameter that contains information 
concerning differences in the associated values of that gene among the sets; 
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adjusting the parameters of the plurality of genes so that variables related to the 
parameters are substantially independent of variations in scatter values or average 
associated values of the genes over the sets, said scatter values defined by standard 
deviation of the associated values in the sets; 

deriving an observed value and an expected value of the adjusted parameter for 
each gene from the sets of associated values, said expected value being indicative of the 
extent of variations in the adjusted parameter introduced by the process; 

comparing the observed and expected values of the parameter to identify genes 
whose associated values differ by an amount of statistical significance among the sets; 
and 

providing a list of genes whose associated values differ by an amount of statistical 
significance among the sets. 

59. (Cancelled) 

60. (Previously Presented) A computer system for analyzing a plurality of original 
sets of values associated with a plurality of genes to identify genes whose associated 
values differ by an amount of statistical significance among the sets, said associated 
values comprising levels of mRNA or protein, said associated values acquired by a 
process where biological samples containing said plurality of genes are hybridized to one 
or more microarrays of probes, thus measuring the levels of mRNA or protein in the 
biological samples, wherein the system comprises: 

one or more computers; 

one or more computer programs running on the computer(s), performing the 
following: 

calculating for each gene a value for a statistical parameter indicating differences 
between associated values of such gene among the original sets; 
ranking the values of the parameter of the genes; 

providing an expected value of such parameter for each rank, wherein said 
providing includes permuting the associated values in the original sets to arrive at sets 
different from the original sets for each permutation, deriving a value of such parameter 
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for each permutation, and ranking such values, said expected value for each rank being 
indicative of the extent of variations in the parameter for parameters in said rank 
introduced by the process; 

comparing the calculated and expected values for the parameter of the same rank 
to identify genes whose associated values differ by an amount of statistical significance 
among the sets; and 

providing a list of genes whose associated values differ by an amount of statistical 
significance among the sets. 

61-66. (Cancelled) 
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